Previs: a person-specific realistic virtual speaker

نویسندگان

  • Javier Melenchón
  • Francesc Alías
  • Ignasi Iriondo Sanz
چکیده

This paper describes a 2D realistic talking face. The facial appearance model is constructed with a parameterised 2D sample based model. This representation supports moderated head movements, facial gestures and emotional expressions. Two main contributions for talking heads applications are proposed. First, the image of the lips is synthesized by means of shape and texture information. Secondly, a nearly automated training process makes the talking face personalization easier, due to the use of mouth tracking. Additionally, lips are synchronized in real time with speech that is generated using a SAPI compliant text-to-speech engine.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Communication Task in HMD Virtual Environments: Speaker and Listener Movement Improves Communication

In this paper we present an experiment which investigates the influence of animated real-time self-avatars in immersive virtual environments on a communication task. Further we investigate the influence of 1st and 3rd person perspectives and the influence of tracked speaker and listener. We find that people perform best in our communication task when both the speaker and the listener have an an...

متن کامل

Virtual People: Capturing Human Models to Populate Virtual Worlds

In this paper a new technique is introduced for automatically building recognisable moving 3D models of individual people. Realistic modelling of people is essential for advanced multimedia, augmented reality and immersive virtual reality. Current systems for whole-body model capture are based on active 3D sensing to measure the shape of the body surface. Such systems are prohibitively expensiv...

متن کامل

Algorithms for Audiovisual Speaker Localisation in Reverberant Acoustic Environments

Innovative and future human-machine interfaces or video conference systems require knowledge of the speaker’s position for automatic beamformerand camera-steering purposes. To determine this position, acoustical as well as visual localisation techniques can be applied, and the aim of this project was to develop suitable algorithms for such an audiovisual speaker localisation. Furthermore, an ex...

متن کامل

The MultiLis Corpus - Dealing with Individual Differences in Nonverbal Listening Behavior

Computational models that attempt to predict when a virtual human should backchannel are often based on the analysis of recordings of face-to-face conversations between humans. Building a model based on a corpus brings with it the problem that people differ in the way they behave. The data provides examples of responses of a single person in a particular context but in the same context another ...

متن کامل

Study of Applicability of Virtual Users in Evaluating Multimodal Biometrics

A new approach of enlarging fused biometric databases is presented. Fusion strategies based upon matching score are applied on active biometrics verification scenarios. Consistent biometric data of two traits are used in test scenarios of handwriting and speaker verification. The fusion strategies are applied on multimodal biometrics of two different user types. The real users represent two bio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002